The hybrid clustering approach combining lexical and link-based similarities suffered for a long time from the different properties of the underlying networks. We propose a method based on noun phrase extraction using natural language processing to improve the measurement of the lexical component. Term shingles of different length are created form each of the extracted noun phrases. Hybrid networks are built based on weighted combination of the two types of similarities with seven different weights. We conclude that removing all single term shingles provides the best results at the level of computational feasibility, comparability with bibliographic coupling and also in a community detection application
In Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics (pp. 234-...
Abstract: In this paper we present a method for the automatic term clustering. The method uses a hyb...
In this article we present an approach to the automatic discovery of term similarities, which may se...
The hybrid clustering approach combining lexical and link-based similarities suffered for a long tim...
Clustering of hybrid document networks combining citation based links with lexical similarities suff...
Clustering of hybrid document networks combining citation based links with lexical similarities suff...
The introduction of textual analysis and the use of lexical similarities already proved an important...
In this paper, we introduce a new similarity measure between words, and a graph-based word clusterin...
We consider a challenging clustering task: the clustering of muti-word terms without document co-occ...
Discovering synonyms and other related words among the words in a document collection can be seen as...
We consider a challenging clustering task: the clustering of muti-word terms without document co-occ...
Similarity analysis is a substantial issue in both corpus-based researches and language usages. This...
Discovering synonyms and other related words among the words in a document collection can be seen as...
Most traditional text clustering methods are based on “bag of words ” (BOW) representation based on ...
The present study presents a semi-automatic method for parsing and filtering of noun phrases from ci...
In Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics (pp. 234-...
Abstract: In this paper we present a method for the automatic term clustering. The method uses a hyb...
In this article we present an approach to the automatic discovery of term similarities, which may se...
The hybrid clustering approach combining lexical and link-based similarities suffered for a long tim...
Clustering of hybrid document networks combining citation based links with lexical similarities suff...
Clustering of hybrid document networks combining citation based links with lexical similarities suff...
The introduction of textual analysis and the use of lexical similarities already proved an important...
In this paper, we introduce a new similarity measure between words, and a graph-based word clusterin...
We consider a challenging clustering task: the clustering of muti-word terms without document co-occ...
Discovering synonyms and other related words among the words in a document collection can be seen as...
We consider a challenging clustering task: the clustering of muti-word terms without document co-occ...
Similarity analysis is a substantial issue in both corpus-based researches and language usages. This...
Discovering synonyms and other related words among the words in a document collection can be seen as...
Most traditional text clustering methods are based on “bag of words ” (BOW) representation based on ...
The present study presents a semi-automatic method for parsing and filtering of noun phrases from ci...
In Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics (pp. 234-...
Abstract: In this paper we present a method for the automatic term clustering. The method uses a hyb...
In this article we present an approach to the automatic discovery of term similarities, which may se...